Decision-Dependent Distributionally Robust Markov Decision Process Method in Dynamic Epidemic Control

نویسندگان

چکیده

In this paper, we present a Distributionally Robust Markov Decision Process (DRMDP) approach for addressing the dynamic epidemic control problem. The Susceptible-Exposed-Infectious-Recovered (SEIR) model is widely used to represent stochastic spread of infectious diseases, such as COVID-19. While Processes (MDP) offers mathematical framework identifying optimal actions, vaccination and transmission-reducing intervention, combat disease spreading according SEIR model. However, uncertainties in these scenarios demand more robust that less reliant on error-prone assumptions. primary objective our study introduce new DRMDP allows an ambiguous distribution transition dynamics. Specifically, consider worst-case probabilities within decision-dependent ambiguity set. To overcome computational complexities associated with policy determination, propose efficient Real-Time Dynamic Programming (RTDP) algorithm capable computing policies based reformulated accurate, timely, scalable manner. Comparative analysis against classic MDP demonstrates achieves lower proportion infections susceptibilities at reduced cost.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributionally Robust Markov Decision Processes

We consider Markov decision processes where the values of the parameters are uncertain. This uncertainty is described by a sequence of nested sets (that is, each set contains the previous one), each of which corresponds to a probabilistic guarantee for a different confidence level so that a set of admissible probability distributions of the unknown parameters is specified. This formulation mode...

متن کامل

Robust partially observable Markov decision process

We seek to find the robust policy that maximizes the expected cumulative reward for the worst case when a partially observable Markov decision process (POMDP) has uncertain parameters whose values are only known to be in a given region. We prove that the robust value function, which represents the expected cumulative reward that can be obtained with the robust policy, is convex with respect to ...

متن کامل

Distributionally Robust Optimization for Sequential Decision Making

The distributionally robust Markov Decision Process approach has been proposed in the literature, where the goal is to seek a distributionally robust policy that achieves the maximal expected total reward under the most adversarial joint distribution of uncertain parameters. In this paper, we study distributionally robust MDP where ambiguity sets for uncertain parameters are of a format that ca...

متن کامل

Robust Markov Decision Processes

Markov decision processes (MDPs) are powerful tools for decision making in uncertain dynamicenvironments. However, the solutions of MDPs are of limited practical use due to their sensitivityto distributional model parameters, which are typically unknown and have to be estimated by thedecision maker. To counter the detrimental effects of estimation errors, we consider robust MDPs...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IISE transactions

سال: 2023

ISSN: ['2472-5854', '2472-5862']

DOI: https://doi.org/10.1080/24725854.2023.2219281